Methylome and transcriptome analyses of three different degrees of albinism in apple seedlings

Background Leaf colour mutations are universally expressed at the seedling stage and are ideal materials for exploring the chlorophyll biosynthesis pathway, carotenoid metabolism and the flavonoid biosynthesis pathway in plants. Results In this research, we analysed the different degrees of albinism in apple (Malus domestica) seedlings, including white-leaf mutants (WM), piebald leaf mutants (PM), light-green leaf mutants (LM) and normal leaves (NL) using bisulfite sequencing (BS-seq) and RNA sequencing (RNA-seq). There were 61,755, 79,824, and 74,899 differentially methylated regions (DMRs) and 7566, 3660, and 3546 differentially expressed genes (DEGs) identified in the WM/NL, PM/NL and LM/NL comparisons, respectively. Conclusion The analysis of the methylome and transcriptome showed that 9 DMR-associated DEGs were involved in the carotenoid metabolism and flavonoid biosynthesis pathway. The expression of different transcription factors (TFs) may also influence the chlorophyll biosynthesis pathway, carotenoid metabolism and the flavonoid biosynthesis pathway in apple leaf mutants. This study provides a new method for understanding the differences in the formation of apple seedlings with different degrees of albinism. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08535-3.


Background
Photosynthesis is crucial for plant growth, development and survival [1]. Plants are a major organ of photosynthesis, and colour mutations in leaves can influence plant growth and development and even cause death, resulting in economic losses; therefore, most leaf colour mutants are removed individually. However, leaf colour mutation, also known as chlorophyll (Chl)-deficient mutation (CDM), is a common phenomenon in nature. CDMs are ideal materials for researching plant chlorophyll metabolism, photosynthesis systems, chloroplast development and genetic breeding [2,3].
There are many factors that lead to CDMs. Most leaf mutants are caused by genetic changes, and greater than 700 sites are involved in mutations in leaf colouration in higher plants [17]. However, the underlying mechanism is complicated. The genetic changes in CDMs can directly or indirectly influence the synthesis, degradation, content and proportions of pigments, decreasing photosynthesis and resulting in abnormal leaf colours.
Cytosine methylation is extensively found in eukaryotes and plays a vital function in many processes, including genome integrity maintenance, transcription regulation, transposable element silencing and imprinting [18]. The methylation of DNA occurs in three different sequence contexts in plants: CG, CHG and CHH (wherein H stands for A, T or C) [19]. Many mechanisms are involved in establishing, maintaining and removing DNA methylation marks. Methyltransferase 1 (MET1) and chromomethylase 3 (CMT3) maintain methylation in the CHG and CG sequence contexts, respectively [19], whereas domain-rearranged methyltransferase 2 (DRM2) is responsible for maintaining CHH site methylation [20]. Currently, some studies involving whole-genome bisulfite sequencing (WGBS) have been reported in mutations of plant colouration associated with DNA methylation, such as apple fruit somatic mutants [21] and Prunus mume flower colour chimaera mutants [22], and a few reports have involved CDMs, especially apple leaf mutants.
RNA sequencing (RNA-seq) is an important technology used in the molecular research of plants. In this method, high-throughput sequencing technology is used to sequence cDNA libraries, and RNA-seq involves the reverse transcription of total RNA in tissues or cells to reflect gene expression levels. RNA-seq is also mainly used to identify plant colour mutationrelated genes, and this method can determine the efficiencies and accuracies of these genes. The identification, cloning, and functional research of mutant genes and their associations with bisulfite sequencing (BS-seq) are of great significance in explaining the molecular mechanism of CDMs in plants [17].
Apples (Malus domestica) are extensively cultivated and economically significant perennial fruit crops. Most colour mutants focus on the colours of apple fruits [21], and few studies have examined the CDMs of apples. We used the methylome and transcriptome to analyse three different CDMs in apple rootstock G.935. Methylation and transcription analyses of apple seedlings with different degrees of albinism indicated that nine genes associated with carotenoid metabolism and the flavonoid biosynthesis pathway were differentially methylated and may be regulated by methylation changes and transcription factors (TFs). The results provide a new perspective for understanding the three different degrees of albinism in apple plants.

Phenotypic characteristics of apple leaves
Different CDMs of apple leaves presented significantly white, piebald and light-green leaf colours (Fig. 1A). The contents of Chl a, Chl b, Chl a + b and carotenoid in the apple leaf mutants were lower than those in normal leaves (NL) (Fig. 1B). The order of these contents was NL > light-green leaf mutants (LM) > piebald leaf mutants (PM) > white-leaf mutants (WM), demonstrating that the decreased Chl a, Chl b and carotenoid concentrations might be the key reason for the apple CDMs. As the photosynthetically active radiation (PAR) increased (Fig. 1C), the light-saturated net photosynthetic rates (Pn) of mutant leaves and normal leaves were significantly different. NL showed the highest light-saturated Pn value, and that of PM was the lowest followed by LM. The WM leaves had no measured Pn value.

Whole-genome bisulfite sequencing of different apple leaves
According to different CDMs in the apple seedlings, we prepared three pairs of libraries (WM/NL, PM/NL and LM/NL) from different apple leaves showing different leaf mutations. In the BS-Seq analyses, approximately 10,000,000 clean reads were filtered from each sample.  (Table S1). Figure S1A shows the cumulative distribution of the C-base effective sequencing depth. The fraction containing the CG, CHG and CHH sequence contexts approached 100% with the minimum depth. The fraction peaked at approximately 10 to 20 times the read depth of the additional peak at the minimum read depth (Fig. S1A), indicating that the quality of the sequencing data was high. The sequence depth distributions of the four apple leaves are described in Fig.  S1B. The data in Fig. S1C indicate the relative proportions of methylcytosines (mCs) in CG, CHG and CHH sequence contexts of four apple leaves, and the mC frequencies were roughly equal; the CG, CHG and CHH sites were each approximately 30%. Based on a genomewide perspective, the methylation landscapes of four apple genomes were apparent and are shown in Fig. S2. The mC distribution in all 17 apple chromosomes in all sequence contexts is presented in Fig. S3.

DNA methylation patterns among different apple seedlings
The methylation profiles in gene regions were analysed to investigate the DNA methylation patterns in the mutant and wild-type apple leaves, as shown in Table S2. In each gene region of the four different leaves, the methylation level at CG was highest, followed by CHG and C ( Fig. 2A). To explore the DNA methylation patterns in different apple mutant leaf genomic regions, we analysed the methylation profiles within genes. Methylation at CG was present at the highest levels in upstream regions, followed by downstream regions, introns and exons. Less methylation occurred at CG and CHH sites than at CHG sites in the upstream and downstream regions and first exons. CG sites had higher methylation levels in internal exons, introns and the last exons than CHG and CHH sites. The CHH sites showed the lowest methylation levels in all regions (Fig. 2B).

Differentially methylated regions in different apple leaves
Differentially methylated regions (DMRs) were identified to research the extent of differential methylation among the four apple leaves. Table S3 lists the distribution of the lengths of DMRs and the number of each chromosome. A total of 61,755 DMRs were detected between WM and NL (WM/NL), 79,824 were detected between PM and NL (PM/NL), and 74,899 were detected between LM and NL (PM/NL), as shown in Table S3.
The DMRs were divided into DMR-associated genes and promoters, and the DMRs overlapped with the genes and promoters. A total of 9380 DMR-associated genes were found in WM/NL, 12,161 were found in PM/ NL, and 10,486 were found in LM/NL (Fig. 3A). A total of 19,089 DMR-associated promoters were identified in WM/NL, 22,988 were identified in PM/NL, and 22,599 were identified in LM/NL (Fig. 3B). There were more hyper-DMR-related genes than hypo-DMR-related genes in both WM/NL and PM/NL, and PM/NL also had more hyper-DMR-related promoters than hypo-DMR-related promoters; the opposite patterns were observed in DMRrelated genes and promoters of LM/NL and DMR-related promoters of WM/NL (Fig. 3B). Based on Gene Ontology (GO) analysis, DMR-associated genes were involved in different biological processes, such as catalytic activities, metabolic processes, cellular processes, single-organism processes and binding (Fig. 3C). The DMR-associated The data are presented as the means of 3 replicates with the SD promoters revealed genes involved in metabolic processes, including catalytic activities, cellular processes, single-organism processes, binding and cell partitioning (Fig. 3D). As reflected by the analyses, the DMRs in the Kyoto Encyclopedia of Genes and Genomes (KEGG) were abundant in pathways involving flavonoid biosynthesis, flavone and flavonol biosynthesis and photosynthesis-antenna proteins (Fig. S4).

Differential gene expression among the four apple leaves
To discover the global gene expression among the different apple leaves, RNA-seq analyses were performed for the four apple leaves (each leaf was assessed using 3 biological replicates). A summary of the sequencing data is listed in Table S4. Principal component analysis (PCA) showed that the NL, LM, PM and WM samples were separately aggregated, indicating that significant differences existed in the gene expression profiles. The four leaves replicates did not strictly cluster together, suggesting that inoculation within the replicates differed (Fig. S5). After removing low-quality reads, the differentially expressed genes (DEGs) were identified with a fold change ≥1 and p-value < 0.05. Fig. S6A In the GO analysis, the DEGs were divided into 37 functional terms, such as biological process, cellular component and molecular function. Most GO term genes of those functional terms were downregulated in WM/NL and PM/NL, and upregulated in LM/NL. The biological process genes were enriched in metabolic, cellular and single-organism processes (Fig. S7A). Regarding their biological functions, KEGG analysis showed that the DEGs were observably involved in flavonoid biosynthesis, DNA replication, metabolic pathways and secondary metabolite biosynthesis pathways (Fig. S7B).
Among the DEGs, a total of 1227 DEGs were found in all comparisons (Fig. 4A), which were involved in metabolic processes, genetic information processing, environmental information processing, cellular processes and organismal systems (Fig. 4B). Enrichment analysis of the DEGs showed that these genes were significantly enriched in metabolic pathways, secondary metabolite biosynthesis and flavonoid biosynthesis (

Analysis of the DEGs related to the chlorophyll biosynthesis pathway, carotenoid metabolism and flavonoid biosynthesis pathway
Leaf pigments, such as chlorophyll, carotenoids and flavonoids, can influence plant leaf colouration [3]. Therefore, the DEGs involved in these pathways were selected to investigate differential gene expression among different leaf colourations (Table S5) and MD15G1024100), LAR (MD06G1211400, MD13G1046900 and MD16G1048500) and ANS (MD03G1001100, MD06G1071600 and MD07G1222600) in flavonoid biosynthesis pathways (Fig. 5C).
The green leaf colouration is due to chlorophyll accumulation. Chlorophyllide and oxygenase (CAO) gene expression was decreased 1.42-, 1.69-and 0.21-fold in WM, PM and LM, respectively, compared with that in NL. The synthesis of 5-aminolevulinic acid (ALA) is a rate-limiting step in chlorophyll biosynthesis. Reduced expression of the synthetic gene glutamyl-tRNA reductase (HEMA) may affect chlorophyll metabolism in WM and LM. Mg-protoporphyrin IX monomethyl ester (CRD1) gene expression was upregulated in the mutant apple leaves in WM and PM. The transcription factor MdMYB308 (MD07G1153200) was downregulated in three mutant apple leaves. Therefore, the downregulated genes might influence the chlorophyll metabolism of mutant apple leaves.

Correlation analysis between DMRs and DEGs in different apple leaves
The correlation between DMRs and DEGs on the wholegenome scale was assessed to investigate the influence of DNA methylation on gene expression. A total of 263,066 DMRs were found between the mutant leaves and the normal leaves. Approximately 51.89% of the DMRs were distributed in the distal intergenic regions, and 30.38% of the DMRs were found in promoter regions (Fig. 6A). The number of hyper-DMRs was approximately equal to the number of hypo-DMRs in WM/NL, and the number of hyper-DMRs was higher than that of hypo-DMRs in PM/NL and the opposite in LM/NL (Fig. 6A). In WM/ NL, 74,840 DMRs (37,717 hypermethylated and 37,123 hypomethylated DMRs) were found, and these DMRs overlapped with 16,033 hypermethylated and 15,858 hypomethylated genes. In PM/NL, 98,020 DMRs (70,511 hypermethylated and 27,509 hypomethylated DMRs) were selected and overlapped with 27,552 hypermethylated and 11,973 hypomethylated genes. In LM/NL, 90,206 DMRs (31,708 hypermethylated and 58,498 hypomethylated DMRs) were found, and these regions overlapped with 14,258 hypermethylated and 22,580 hypomethylated genes (Fig. 6B).
By combining the results obtained from the different mutants, 52,008 and 6776 differentially methylated genes (DMGs) were identified (Fig. 6C). GO analysis was performed on the DMGs to understand the possible role of DNA methylation in the different apple leaves. These results indicated notable differences in biological processes, cellular components and molecular functions during the greening stage (Fig. 6D). Figure  S8 shows the number of DMGs at CG, CHG, and CHH sites between DMRs and DEGs in WM/NL, PM/NL and LM/NL. Methylation at CHH sites was noted in most genes between DMRs and DEGs, followed by methylation at CG and CHG sites. Figure S9 indicates the gene expression patterns and methylation levels. Although the expression levels of hyper-DMR genes and all genes were slightly higher than the expression levels of hypo-DMR genes in the expression of DMR-associated genes and promoters, there was no statistically significant difference between hypomethylated or hypermethylated genes and all genes. There was also no obvious difference in the hypermethylated or hypomethylated with DEGassociated genes or promoters among the three different mutant comparisons.

DMR-associated DEGs are involved in the carotenoid and flavonoid biosynthesis pathways
We also focused on the DMR-associated DEGs involved in the chlorophyll, carotenoid and flavonoid biosynthesis pathways (Fig. 5 and Table S6). There were no DMRassociated DEGs involved in the chlorophyll biosynthesis pathways. In the carotenoid biosynthesis pathway, PDS (MD15G1038500) gene expression was decreased in mutant leaves compared with normal leaves, and the PDS promoter was hypermethylated in PM/NL. In the flavonoid biosynthesis pathway, the PAL (MD04G1096200) gene exhibited decreased expression and a hypermethylated gene body in PM/NL. In WM/NL, LM/NL and PM/ NL, CHS (MD04G1003000) was downregulated with a hypermethylated promoter. In WM/NL, LM/NL and PM/NL, CHI (MD07G1186300 and MD07G1233400) exhibited decreased expression and a hypermethylated gene body. In PM/NL, CHI (MD01G1167300) exhibited decreased expression and hypermethylated gene bodies. The expression of DFR (MD15G1024100) was also decreased, and its gene body and promoter were hypermethylated in PM/NL. In PM/NL and LM/NL, LAR (MD16G1048500) expression was downregulated with a hypermethylated gene body. In PM/NL, ANS (MD03G1001100) downregulated gene with a hypermethylated promoter was observed.

Confirmation of DNA methylation and gene expression data
Several DMR genes (MD07G12334000, MD15G1024100, MD04G1003000, MD03G1001100 and MD13G1257800) involved in the carotenoid and flavonoid biosynthesis pathways were selected for bisulfite-PCR and sequencing analyses. Using four apple leave samples, these regions were subjected to bisulfite-PCR and then sequenced. The bisulfite-PCR results were consistent with the corresponding methylome analysis (Fig. S10), indicating that the sequencing results were reliable.

Discussion
Colour mutation in leaves is an easily identifiable and common phenomenon in higher plants. CDMs frequently affect photosynthetic plant efficiency, causing plant growth delay and economic losses. For this reason, CDMs have been negatively selected in the past. More recently, the use of CDMs has become more common. Mutants are ideal materials for researching chloroplast differentiation and development, pigment metabolism, photosynthesis and other pathways; mutants also supply significant information for improving varietal selection [17].
Apples are a widely cultivated and economically significant perennial fruit. Most research has focused on the colour of apple flesh [21], and the CDMs of apples are rarely involved. In this study, we analysed differences in the methylome analyses and transcriptome analyses of three different natural CDMs of G.935 apple. We provide a reference for future research and application of CDM.
In plants, the DNA methylation machinery may affect growth, development, the regulation of biosynthesis and phenotypic plasticity [25][26][27]. Therefore, it is important to explain the differences in the methylomes of DNA between CDMs and normal apple leaves to understand epigenetic regulation. The DNA methylation dynamics were investigated at a single-base resolution through WGBS in four apple leaves (NL, WM, PM and LM). The genome showed methylation levels of 62.85-67.00%, 44.52-49.25%, and 6.73-9.05% at the CG, CHG and CHH sites, respectively (Table S7). The methylation of (See figure on next page.) CG was found to be more common than the methylation of CHG or CHH sites in previous methylome studies of apple plants [21]. In the four apple leaves studied here, methylation at CG, CHG and CHH sites followed the same trend; methylation was the most common at CG sites, followed by CHG and CHH sites. Among the four leaves, PM had the highest methylation levels, and NL had the lowest methylation levels. This finding indicated that the methylation level in the normal leaves was relatively stable; on the other hand, the PM leaves exhibited substantial changes in their methylation levels. Previous research indicated that the level of DNA methylation positively correlates with genome size [28,29]. Compared with other plants, such as Arabidopsis, Eutrema salsugineum, Vitis vinifera and Beta vulgaris, apple plants show middling methylation levels [29]. These results indicated that DNA methylation primarily functions to maintain genome stability [21].
As an epigenetic marker, DNA methylation plays a role in repressing gene expression. When first conducting genome-wide methylome analyses, the relationship between DNA methylation and gene expression was found to be more complicated than initially believed [3]. For example, in rice and Arabidopsis, promoter methylation restrains the expression of genes only at highly methylated gene loci, and gene body methylation is always positively related to gene expression [30,31]. Global methylation and transcriptional analyses indicated that nonexpressed genes or genes expressed at low levels were highly methylated in the gene body regions, suggesting that gene body methylation negatively correlates with gene expression. Moreover, genes with methylated promoters were more highly expressed than genes with unmethylated promoters, implying that promoter methylation and gene expression are positively correlated [32].
For certain DMR-associated DEGs, the opposite pattern was observed. In the carotenoid biosynthesis pathway, the PDS gene (MD15G1038500) was downregulated with hypomethylated promoters in PM/NL. The analyses also emphasized eight DMR-associated downregulated DEGs related to the flavonoid biosynthesis pathway. Specifically, PAL (MD04G1096200) exhibited decreased expression and a hypomethylated gene body in PM/NL. In WM/NL, LM/NL and PM/NL, CHS (MD04G1003000) represented downregulated genes with hypermethylated promoters. In WM/NL, LM/NL and PM/NL, CHI (MD07G1186300, MD07G1233400) exhibited decreased expression and a hypomethylated gene body. In PM/ NL, CHI (MD01G1167300) was decreased and had a hypomethylated gene body. DFR (MD15G1024100) also showed decreased expression with a hypermethylated gene body and promoter in PM/NL. In PM/NL and LM/ NL, LAR (MD16G1048500) exhibited downregulated expression with a hypermethylated gene body, and ANS (MD03G1001100) exhibited downregulated expression with a hypomethylated promoter. In general, the expression of many genes was not associated with the corresponding methylation level. Although some downand upregulated genes were related to hyper-and hypo-DMRs, many DEGs did not show remarkably different methylation levels. DNA methylation influences the expression of genes and is likely influenced by a combination of direct and indirect mechanisms [21]. The bisulfite-PCR and RT-qPCR results were consistent with the corresponding methylome analysis and RNA-seq data ( Fig. S10 and S11), indicating that the sequencing results were reliable.
Plant leaf colouration is also influenced by the pigments in leaves, such as chlorophyll, carotenoids and flavonoids [3]. Leaf colouration is also affected by the chlorophyll, carotenoid and flavonoid biosynthesis pathways. In plant leaves, the chlorophyll and carotenoid contents are the main colouring matter that capture light energy. The colouration of plant leaves is mainly influenced by the biosynthesis and transport of chlorophyll and carotenoids [15,33]. Many albino leaves have conspicuously decreased chlorophyll levels [34,35]. In this research, less Chl a and Chl b were detected in the CDMs than in NL (Fig. 1A). Chl b biosynthesis regulates light-harvesting efficiency and photosynthetic antenna size [36]. The decreased chlorophyll contents observed in the CDMs indicate that the mutants might have less light-harvesting efficiency than NL and that this factor may affect leaf colouration. The expression of CAO, CRD1, HEMA and the transcription factor MdMYB308 was decreased, which may induce low Chl b levels in the three mutant leaves (Fig. 5A). MdMYB308 (MD07G1153200) was also downregulated in three mutant apple leaves. The homologous gene CsMYB36 (Csa2G352940) is involved in the formation of the yellow-green peel in Cucumis sativus [37], and a CsMYB36 mutation could affect the synthesis of L-glutamate-tRNA to 5-aminopentanone, resulting in low chlorophyll levels in the pericarps of yellow-green peel mutants and affecting the chlorophyll content.
The pigments of carotenoids are red, orange, or yellow. Carotenoid biosynthesis is balanced with chlorophyll synthesis in the chloroplast, and carotenoids are also essential for photoprotection [38]. Various carotenoids exhibit changes in composition, especially displaying markedly increased zeaxanthin levels, leading to albino tea cultivars [35] and yellow-leaf mutants of winter wheat [33]. The total carotenoid contents in NL, WM, PM and LM were obviously different, and the carotenoid contents in mutants were significantly decreased compared with those in normal leaves (Fig. 1A). In particular, NCED is a rate-limiting enzyme that controls carotenoid transformation to ABA, and NCED gene expression was upregulated in WM, PM and LM (Fig. 5B) and may influence carotenoid degradation and contribute to carotenoid component changes.
Flavonoids, including anthocyanins, flavones and flavonols, also affect leaf colouration [15]. Structural gene expression in the flavonoid metabolic pathway can influence plant colouration. Under light induction, the activity of PAL and anthocyanin levels increase in purple-foliage plum leaves, and the colour of leaves gradually turns purplish red [39]. CHS is also a key enzyme in the anthocyanin biosynthetic pathway. When the CHS1 gene of Freesia hybrida was incorporated into Petunia hybrida as a transgene, the colour of P. hybrida changed from white to pink [40]. Silencing of the CHI gene can induce yellow pigmentation in tobacco and carnation [41]. Moreover, the oxidation of downstream colourless proanthocyanin is regulated by ANS into the coloured anthocyanin. The genes and transcription factors in the flavonoid metabolic pathway, including PAL, 4CL, CHS, CHI, DFR, LAR, ANS and MdbHLH3 were decreased in apple with different CDMs (Fig. 5C). The increased methylation of these genes and the regulation of transcription factors may be responsible for the downregulation of the genes and influence the phenotypes of apple leaves.

Conclusions
The methylomes and transcriptomes of apple seedlings with three different degrees of albinism were analysed, and DMR-associated DEGs in carotenoid metabolism and the flavonoid biosynthesis pathway were identified in three comparisons (WM/NL, PM/NL and LM/ NL). The DEGs included PDS (MD15G1038500), PAL (MD04G1096200), CHS (MD04G1003000), CHI (MD01G1167300, MD07G1186300, MD07G1233400), DFR (MD15G1024100), LAR (MD16G1048500) and ANS (MD03G1001100), and had hypermethylated promoter or gene body regions. The expression of different transcription factors may also influence the chlorophyll, carotenoid and flavonoid biosynthesis pathways. The differential methylation of structural gene promoters and transcription factors affected different biosynthesis pathways in the apple leaf mutants, which may explain the induction of white-leaf mutants, piebald leaves and lightgreen leaves in apple leaves.

Plant materials and growth conditions
Seedlings of apple rootstock G.935 with different degrees of albinism selected by natural variability and wild type G.935 were used in this study. G.935 plants with three different degrees of mutation and wild-type plants were grown at 25 °C in a greenhouse with a 16-h light/8-h dark photoperiod for two months. The virescent leaves of the mutant plant were named light-green leaves (LM), the zebra leaves were named piebald leaf mutants (PM), and the albino leaves were named white-leaf mutants (WM). Wild-type G.935 was treated as a control and termed normal leaves (NL). After the determination of net photosynthesis, the chlorophyll contents and carotenoid contents, the mature leaves from the three mutants and the NL at 9 to 12 positions along the stem base were sampled between 10 and 11 am, immediately frozen in liquid nitrogen, and used for the WGBS and RNA-seq analysis.

Photosynthetic measurements
Leaves were collected from 4 different apple plants to detect different physiological indexes. Ninety-five percent ethyl alcohol was used to extract Chl a, Chl b, Chl a + b and carotenoids, and the concentrations were determined spectrophotometrically as described by Arnon [42].
The Pn of apple leaves was monitored on sunny days as described by Sun et al. [43]. PAR was gradually decreased stepwise using an integrated light-emitting diode (LED) light source from 2000 to 0 μmol m − 2 s − 1 , and the Pn was recorded at each PAR point when it was stable.

WGBS analysis
Apple DNA genomes were extracted from mutant and wild-type leaves using a Plant Genomic DNA Purification Kit (Tiangen, Beijing, China). Sonication was performed to fragment the DNA into 100-to 300-bp fragments (Covaris, MA, USA). The genomic fragments were ligated with adapters, converted with bisulfite using a Methylation-Gold kit (Zymo, CA, USA), and sequenced by using an Illumina HiSeq ™ 2500 from Gene Denovo Biotechnology Co. (Guangzhou, China).
For the data analysis, BSMAP software was used to map the clean reads back to the apple reference genome. The sliding-window approach with a 200-bp window slide at 50-bp intervals was used for each sequence context to identify the DMRs. DMRs for each sequence context (CG, CHG and CHH) between sample 1 and sample 2 were determined according to different criteria: at least five methylated cytosine sites; coverage of more than ten reads; distance between adjacent methylated sites < 200 bp; methylation level of greater than 20%; and Fisher's exact test p-values < 0.05 and false discovery rate (FDR) < 0.05 using Methylkit (V1.4.1) [44]. The DMRassociated genes and promoters were characterized after the DMRs were identified. Gene regions were classified into the promoter (2 kb upstream from transcription start sites), exon, intron, upstream (up to 2 kb gene start) and downstream (up to 2 kb gene end) regions.
GO analysis was performed with GO::Term Finder software [45] and the KEGG enrichment tool (http:// www. kegg. jp/ kegg/) [46] to study the DMR-associated genes and promoters. Bonferroni correction was used to calculate the p-value. GO terms with corrected p-values < 0.05 were regarded as greatly enriched by DMR-associated genes or promoters. KEGG pathways with p-values < 0.05 were considered significantly enriched with DMRrelated genes.

RNA extraction and the transcriptome sequencing
The total RNA of the apple leaves was separated using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) for transcriptome analysis, and three biological replicates were used for each sample. For library construction, 5 μg of RNA per sample was used. The libraries were sequenced using an Illumina HiSeq2500 from Gene Denovo Biotechnology Co. (Guangzhou, China). After rapid filtering [47] (version 0.18.0), the clean reads were compared with the apple genome (https:// iris. angers. inra. fr/ gddh13/ index. html) by HISAT2.2.4 and Bowtie2 tools [48,49]. For each transcription region, the fragment per kilobase of transcript per million mapped reads (FPKM) values were calculated using RESM software [50]. DESeq2 software was used to identify the DEGs [51] based on a fold change ≥2 and divergence probability ≥0.8 [52]. The DEGs were analysed according to the default parameters by GO and KEGG enrichment, and GO terms with corrected p-values < 0.05 were considered significantly enriched by DEGs.

Bisulfite sequencing PCR (BSP) analysis
Samples of gDNA (750 ng) from different apple leaves, including WM, PM, LM and NL, were treated with an EZ DNA Methylation-Gold Kit (Zymo Research) according to the method of Jiang et al. [21]. Each fragment was subjected to three independent PCRs to produce 12 independent clones for sequencing. The online software Kismeth was used to analyse the results (http:// katah din. mssm. edu/ kisme th). The primers used for BSP-PCR are listed in Table S8.

Reverse transcription and qRT-PCR
A Revert Aid First Strand cDNA Synthesis Kit (Thermo Scientific, Waltham, MA, USA) was used to reverse transcribe 1 μg RNA with a CFX96 instrument (Bio-Rad, Hercules, CA, USA) and SYBR ® Premix Ex Taq ™ II (Takara, Dalian, China) to perform quantitative real-time reverse transcriptase PCR (qRT-PCR). The gene elongations factor 1α in M. domestica (EF-1α; DQ341381) was used to standardize different genes in the cDNA samples [53]. The 2 -ΔΔCT method was used to compute the relative expression level of each gene [54]. Three biological samples were used in all experiments. The primers used for qRT-PCR are listed in Table S8.

Statistical analysis
Three biological and three technical replicates were included in all experiments. The data are presented as the mean ± SD. The significant differences among groups were evaluated using Duncan's multiple-range test. The p-values < 0.05 were considered statistically significant.

Supplementary Information
The online version contains supplementary material available at https:// doi. org/ 10. 1186/ s12864-022-08535-3. Table S1. Information on the bisulfite sequencing libraries. Table S2. The coverage of the intergenic region. Table S3. The DMRs distributed on each chromosome. Table S4. RNA-seq sequencing data and mapping of the genome. Table S5. Differentially expressed genes involved in the flavonoid, chlorophyll and carotenoid biosynthesis pathways in WM/NL, PM/NL and LM/NL. Table S6. DMR-associated DEGs involved in the carotenoid and flavonoid biosynthesis pathways. Table S7. The average methylation levels in NL, WM, PM and LM. Table S8. The primers used in qRT-PCR and BS-PCR. Fig. S1. Information of the sequencing and mapping. Fig. S2. The global methylome in NL, WM, PM and LM.

Availability of data and materials
The methylome data and transcriptome data have been deposited into the National Center for Biotechnology Information (NCBI) databases, and the bioproject accession number is PRJNA705949 (https:// www. ncbi. nlm. nih. gov/ sra/ PRJNA 705949).

Declarations
Ethics approval and consent to participate Seedlings of apple rootstock G.935 were purchased from the Vivai F.lli Zanzi company for scientific research, and the different degrees of albinism seedlings were selected by natural variability. The authors complied with international, national and institutional guidelines.

Consent for publication
Not applicable.